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SPECIFICATION 



MAMMALIAN GENES INVOLVED IN CIRC AD IAN PERIODS 



5 Technical Field 

The present invention relates to mammalian genes whose 
expression changes with a circadian period. 

Background Art 



behavioral processes in various organisms ranging from 
microorganisms to vertebrates exhibit circadian rhythms ( Edmunds , 
L. N . J., Cellular and Molecular Basis of Biological Clock, 
Springer-Verlag, New York, 1988). Several genes have been 

15 suggested to be involved in circadian rhythms. 

For example, two mammalian circadian clock mutations have been 
confirmed thus far. They are Clock of the mouse (Vitaterna, M. H-, 
et al., Science 264: 719-725, 1994) and tau of the hamster (Ralph, 
M. R. and Menaker, M-, Science 241: 1225-1227, 1988). The Clock 

20 gene has recently been identified and is believed to encode a 
transcription factor in the circadian clock (Moor, R . Y . andEichler, 
V. B . / Brain Res. 42: 201-206: 1972; Stephan, F. K. and Zucker, I., 
Proc. Natl. Acad. Sci. USA 69: 1583-1586, 1972). On the other hand, 
the tau gene has not yet been cloned . 

25 The period (per) gene has been isolated from Drosophila as 

a gene necessary for the expression of circadian rhythms for 
locomotive activities and eclosion behavior (Konopka, R. J. and 
Benzer, S., Proc. Natl. Acad. Sci. USA 68: 2112-2116, 1971). In 
the brain of the fly the oscillation of the levels of the per mRNA 

30 and of the PERIOD (dPER) protein are thought to determine the rhythms 
(Hardin, P. E., et al., Nature 343: 536-540, 1990; Zerr, D. M. , et 
al., J. Neurosci. 10: 2749-2762, 199^. However, per homologues 
in other organisms than insects have not been identified - 
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Many biochemical processes, physiological processes, and 



35 



Disclosure of the Invention 

An object of the present invention is to provide novel 
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mammalian proteins and the genes thereof that are involved in the 
circadian period. More specifically, the object is to provide 
mammalian proteins and the genes thereof that are functionally 
equivalent to those of the Drosophlla period (per) gene product. 



a region expected to play a functionally important role within the 
Drosophila gene known to be involved in the circadian rhythms, and 
performed a type of PCR, which had been developed on our own, using 
the primers des igned based on the sequence of the region . As a result , 

10 we succeeded in isolating a human gene that corresponds to the 
above-mentioned Drosophila gene. We also succeeded in isolating 
a mouse gene that corresponds to the human gene by using the isolated 
human gene as a probe. Furthermore, we analyzed structures of the 
proteins encoded by the human and the mouse genes thus isolated and 

15 discovered that these proteins highly conserve the functional 
domains and the structural domains that have been identified in the 
Drosophila protein- In addition, analysis of the expression of the 
isolated mouse gene in the suprachiasmatic nucleus, which is the 
region responsible for functioning as a circadian pacemaker in the 

20 mammalian brain, revealed that the expression of the gene fluctuates 
with a circadian period. 

Namely, the present invention relates to proteins and the 
genes thereof that are involved in the circadian periods of mammals, 
and more specifically to 

25 (1) a protein derived from a mammal whose expression level in the 
suprachiasmatic nucleus (SCN) fluctuates with a circadian period, 

(2) a protein of (1) wherein the mammal is a human, 

(3) a protein of (1) wherein the mammal is a mouse, 

(4) a protein involved in the formation of circadian rhythm in 
30 the suprachiasmatic nucleus (SCN) comprising the amino acid 

sequence described in SEQ ID NO: 1 or said sequence in which one 
or more amino acids are substituted, deleted, or added, 

(5) a protein involved in the formation of circadian rhythm in 
the suprachiasmatic nucleus (SCN) comprising the amino acid 

35 sequence described in SEQ ID NO: 2 or said sequence in which one 



5 



To attain the above object, the present inventors focused on 



or more amino acids are substituted, deleted, or added, 




(6) a protein involved in the formation of circadian rhythm in 
the suprachiasmatic nucleus (SCN) encoded by the DNA having a 
sequence described in SEQ ID NO: 3 or by DNA that hybridizes with 
the DNA described in SEQ ID NO: 3, 
5 (7) a protein involved in the formation of circadian rhythm in 
the suprachiasmatic nucleus (SCN) encoded by the DNA having a 
sequence described in SEQ ID NO: 4 or by DNA that hybridizes with 
the DNA described in SEQ ID NO: 4, 

(8) DNA encoding any of the proteins of (1) to (5), 
10 (9) DNA having the sequence described in SEQ ID NO: 3 or DNA that 
hybridizes with the DNA having the sequence described in SEQ ID 
NO : 3 , wherein the DNA encodes a protein involved in the formation 
of circadian rhythm in the suprachiasmatic nucleus (SCN), 

(10) DNA having the sequence described in SEQ ID NO: 4 or DNA that 
15 hybridizes with the DNA having the sequence described in SEQ ID 

NO: 4, wherein the DNA encodes a protein involved in the formation 
of circadian rhythm in the suprachiasmatic nucleus (SCN), 

(11) a vector carrying any of the DNA of (8) to (10), 

(12) a transformant expressibly retaining any of the DNA of (8) 
20 to (10), and 

(13) a method for producing any of the proteins of (1) to (7), the 
method comprising culturing the transformant of (12). 

Herein, the "circadian periods" means the activity rhythms 
with a period of approximately 2 4 hours which are observed in a wide 
25 variety of behaviors such as endocrine secretions and body 
temperature, blood pressure, sleep-wakef ulness , and others of an 
organism. 

The expression of the protein of the present invention 
oscillates autonomously with a circadian period in the 

30 suprachiasmatic nucleus (SCN) , which is a major circadian pacemaker 
of the mammalian brain (Moor, R. Y. and Eichler, V. B., Brain Res. 
42: 201-206: 1972; Stephan, F. K . and ZucJcer, I., Proc. Natl. Acad. 
Sci. USA 69: 1583-1586, 1972). The amino acid sequences of the 
proteins derived from the human and the mouse included in the present 

35 invention are shown in SEQ ID NO: 1 and SEQ ID NO: 2, respectively. 
The amino acid sequences of these two mammalian proteins fairly 
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homologous with that of the Drosophlla protein (the period gene 
product) (Citri, Y., et al . , Nature 326: 42-47 , 1987). The period 
gene is required for the expression of the circadian rhythms of 
locomotive activities and hatching behavior in Drosophila (Konopka, 
R.J. and Benzer, S. , Proc . Natl, Acad. Sci. USA 68: 2112-2116, 1971). 
The oscillations of its mRNA and protein levels in the fly brain 
are thought to determine the rhythms (Hardin, P. E . , et al., Nature 
343: 536-540, 1990; Zerr, D. M. , et al., J. Neurosci. 10: 2749- 
2762, 1990). These two proteins show highly homologous with the 
Drosophlla protein in the PAS domains which have been suggested to 
be structurally and functionally important based on the genetic and 
biochemical studies (Baylies, M. K . et al . , Nature 326: 390-392, 
1987 ; Saez, L. and Young, iM. W. , Neuron 17: 911-920, 1996). 

Recently King et al. have cloned the mammalian "Clock" gene, 
which encodes a bHLH-PAS-polyQ polypeptide (King, D. P., et al., 
Cell 89: 641-653, 1997; Antoch, M . P., et al . , Cell 89: 655-667, 
1997 ). The proteins of the present invention can form dimers with 
other molecules such as "CLOCK" by means of the PAS-PAS interaction 
in the circadian clock system. 

The proteins of the present invention can be prepared as a 
recombinant protein utilizing the genetic recombinant technology, 
or as a natural protein. A recombinant protein can be prepared by 
culturing the cells transformed with DNA encoding the protein of 
the present invention as described later. A natural protein can 
be isolated, for example, from the somatic cell tissues, such as 
brain, pancreas, kidney, skeletal muscle, liver, lung, placenta, 
heart, spleen, and testis using an affinity column with an 
appropriate carrier bound to an antibody that is prepared using the 
above-mentioned recombinant protein of the present invention. 

It is possible for a person skilled in the art to prepare a 
protein substantially identical to the protein described in SEQ ID 
NO: 1 or SEQ ID NO: 2 by making amino acid substitutions and other 
modifications to the protein described in SEQ ID NO: 1 using known 
methods. Mutations of amino acids in a protein may also occur 
spotaneously . Thus, the present invention includes modified 
proteins that result from the modification of amino acids of the 
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protein described in SEQ ID NO: 1 or 2 by substitution, deletion, 
or addition, and are involved in the formation of circadian rhythms 
in the suprachiasmatic nucleus (SCN) • The known methods to modify 
amino acids include the ODA ( Oligonucleotide-directed Dual 
5 Amber ) -LA PCR method (Hashimoto-Gotoh, T. , et al. , Gene 152: 271-2 75, 
1995 ) * The amino acids to be substituted are usually within 10 amino 
acids, preferably within 6 amino acids, and more preferably within 
3 amino acids . 



10 that are substantially functionally equivalent to the protein 
described in SEQ ID NO: 1 or 2 from DNAs that are highly homologous 
with the DNA having a sequence described in SEQ ID NO: 3 or 4 and 
isolated from other organisms using such methods as the known 
hybridization technique (Church, G. M - and Gilbert, W. , Proc. Natl. 

15 Acad. Sci- USA 81: 1991-1995, 1984; Sambrook, J . , et al., Molecular 
Cloning, 2 nd ed. , 1989) based on the DNA sequence described in SEQ 
ID NO: 3 or 4 (or part thereof). Thus the proteins encoded by the 
DNA that hybridizes with the DNA sequence described in SEQ ID NO: 
3 or 4 , which are involved in the formation of circadian rhythms 

20 in the suprachiasmatic nucleus (SCN), are also included in the 
proteins of the present invention- The source of the DNA for 
hybridization includes mammals such as rats, dogs, cats, monkeys, 
whales, cattle, pigs, and horses. The DNA encoding the proteins 
from these other organisms should usually highly homologous with 

25 the DNA described in SEQ ID NO: 3 or 4. "Being highly homologous" 
means having at least 60%, preferably at least 70%, more preferably 
at least 80%, and still more preferably at least 90% of sequence 
identity with the DNA described in SEQ ID NO: 3 or 4. The 
hybridization for isolating such DNAs can be performed, for example, 

30 in a mixture consisting of 6 x SSPE, 5 x Denhardt's solution, 0.5% 
SDS, 100 nl/ml denatured salmon sperm DNA, and 50% formamide, usually 
at 42°C, less stringently at 32°C, or more stringently at 65°C. 



proteins of the present invention described above. The DNAs 
35 encoding the proteins of the present invention can be cDNA, genomic 
DNA, or synthetic DNA. The DNAs of the present invention can be 



It is routine for one skilled in the art to obtain proteins 



The present invention also relates to DNAs encoding the 
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utilized, for example, to manufacture the proteins of the present 
invention as recombinant proteins. Namely, the DNA encoding a 
protein of the present invention (for example, the DNA described 
in SEQ ID NO: 3 or 4) is inserted into an appropriate expression 
5 vector, appropriate cells are transformed with the vector, the 
transf ormants are cultured, and the expressed protein is purified 
to prepare the proteins of the present invention as recombinant 
proteins . 

The preferred cells used for the production of the recombinant 
10 proteins include E. coli, yeast, insect cells, and animal cells - 
The vectors used to express the recombinant proteins within these 
cells include the pET system, pAUR system, baculovirus vectors 
(pBlue Bac, etc. ) , and the CMV or RSV promoter-driven vectors, etc. 
The transf ection of the vector into the host cell can be done, 
15 for example, by electroporation for E. coli and yeast, and the 
liposome* method for insect cells and animal cells. The lithium 
acetate method can also be used for yeast. 

The recombinant protein can be purified from the transf ormant, 
for example, by ion exchange, gel filtration, or anti-Per antibody 
20 column chromatography. 

The proteins or the DNAs of the present invention are 
applicable to treat disorders related to circadian rhythms, such 
as sleep phase delay syndrome, sleep phase progression syndrome, 
non-circadian sleep-wake syndrome, irregular sleep-wake disorder, 
25 and time difference syndrome (so-called jet lag). They are also 
applicable to the labor and health management of irregular night 
time workers and to prevention of night poriomania in dementia. 

Brief Description of the Drawings 
30 Figure 1 shows the amino acid sequences within the PAS repeats 

(arrows) that were used to design the primers for IMS-PCR. 

Figure 2 is a photograph showing an electrophoresis image of 
3 bp ladder markers that were electrophoresed on a 10% non-denaturing 
PAGE gel in a non-continuous buffer solution system. A 10 bp DNA 
35 ladder (BRL) was electrophoresed on lane M. 

Figure 3 is a photograph showing an electrophoresis image of 
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the IMS-PCR product (lanes marked with arrows) that was 
electrophoresed along with 59 bp, 65 bp, and 68 bp of the 3 bp ladder 
markers {lanes marked with asterisks). 

Figure 4 shows an amino acid sequence comparison among the 
5 PERIOD family members . hD I AL , mDIAL , and PERIOD indicate the human, 
the mouse, and the Drosophila version of PERIOD, respectively. 
Shaded or dotted boxes indicate homologous sequences, and CI through 
C6 indicate regions conserved among different Drosophila species. 

Figure 5 shows an amino acid sequence comparison among the 

10 PERIOD family members . hDIAL, mDIAL, and PERIOD indicate the human, 
the mouse, and the Drosophila version of PERIOD, respectively. 
Shaded or dotted portions indicate homologous sequences- Sequences 
corresponding to NLS, the PAS-A repeats, the PAS-B repeats, and CLD 
are underlined, and the TG repeats (the SG repeats in the human and 

15 mouse PER) are boxed. Amino acid identities between the human PERIOD 
and the mouse PERIOD are indicated by asterisks above the human 
PERIOD sequence. The identities and homologies between the 
mammalian PERIOD and the Drosophila PERIOD are indicated by 
asterisks and open circles below the Drosophila PERIOD sequence. 

20 Figure 6 is a photograph showing the northern blot analysis 

of hPER. hPER was bound to the filter as a probe, and then G3PDH 
was bound as a loading control. 

Figure 7 is a photograph showing the northern blot analysis 
of mPer. mPer was bound to the filter as a probe, and then G3PDH 

25 was bound as a loading control. 

Figure 8 is a photograph showing the results of In situ 
hybridization of mPer in the mouse brain under the LD (top) and the 
DD (bottom) conditions. SCN is indicated by arrows. The bar 
indicates 2 mm. 

30 Figure 9 shows the results of quantification of in situ 

hybridization data under the LD (top) and the DD (bottom) conditions. 
Each data point is the average ± SEM (n=5). ** indicates 
significance at the 1% significance level, and * at the 5% 
significance level, compared with the values at ZT16 and CT16. The 

35 white portion of the bar represents the light period, and the black 
portions the dark periods . 
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Figure 10 shows the results of the competitive RT-PCR analysis 
on the mPer mRNA under the LD (top) and the DD (bottom) conditions . 
AmPer indicates a competitive factor for mPer and A(3-actin indicates 
a competitive factor for p-actin. The white portion of the bar 
represents the light period, and the black portions the dark periods . 

Best Mode for Carrying out t.h«* I nvention 

The present invention is illustrated in detail below with 
reference to the following examples, but is not to be construed as 
being limited thereto. 

E xample 1 Isolation of the mamma l ian hnmologues of per 

In order to isolate the mammalian homologues of per, the 
inventors have developed a novel method, intramodule scanning (IMS) 
-PCR. The principle of the method is based on the fact that in the 
human genome short stretches of DNA sequences (modules) that encode 
short polypeptide fragments (motifs ) are scattered over long genomic 
distances. If a sufficient number of "intramodule scanning" 
primers are used to cover the entire length of a gene, the module 
can be screened with equal frequencies irrespective of their 
expression levels. 

Genetic and biochemical studies have suggested that the PAS 
domains in dPER are structurally and functionally important (Baylies, 
M. K. et al., Nature 326: 390-392, 1987; Saez, L. and Young, M. w. , 
Neuron 17: 911-920, 1996). Therefore, we designed 18 different 
primers corresponding to the internal sequences of the dPER PAS-A 
and PAS-B repeats (Figure 1 ) . The sequences of the degenerate primer 
pairs for the PAS-A and PAS-B repeats are as follows: 
GTGCTGGGCTACCCN ( A/C ) GNGA ; 
CTGGGCTACCCCC(A/G) ( A/ G ) GANATG ; 
GGCTACCCCC(A/G) ( A/G ) GANATGTGG; 
CTGGGCT ( A/T ) CCTGCCNCA ( A/G ) ; 
CTGGGCT ( A/T ) CCTGCCNCA ( A/G ) GA ; 
GGCTACCTGCC ( C / T ) CA ( A/G ) GAN ( C/T ) ; 
GCCCG ( G/A ) TCCTTCAG ( G/A ) TGNAC ; 
TCCTCATG (A/G) TGCAC (A/G) ( T/ A ) ANTC ; 




ATGTCCTCATG( A/ G ) TG ( C/G ) AC ( A/G ) ( A/T ) A ; and 
GACAC (A/G) TCCTCATG ( A/G ) TG ( A/G ) TA . 
Here, symbols such as A/G mean mixture primers between A and G. 

Since homologous polypeptides share common characteristics 
5 at the corresponding positions within the molecules, when the 
corresponding amino acid sequences are used for synthesizing PCR 
primers, the lengths of the PCR products reflect the characteristics 
of the domain structure in each polypeptide with respect to the 
positions. Considering the lengths of a codon (3 bp) and an exon 

10 (100 bp on average) in a human gene, we synthesized the 3 bp ladder 
markers (53 to 113 bp) by PCR using the series of primers and pUC18 
as the template. An electrophoretic image of these 3 bp ladder 
marker and a 10 bp DNA ladder marker (BRL) are shown in Figure 2. 
The markers were electrophoresed along with the PCR products side 

15 by side in a non-continuous buffer solution system (Ito, T. , Hohjoh, 
H. and Sakaki, Y., Electrophoresis 14: 278-282, 1993) on a non- 
denaturing PAGE (10%) gel (Figure 3). 

Each PCR mixture (Sambrook, J., et al., Molecular Cloning, 
Cold Spring Harbor Laboratory, 1989) contained 0.5 (j.g of human 

20 genomic DNA. The mixture was incubated at 94°C for 1 minute, and 
subjected to 3 cycles of [94°C for 30 seconds, 3 7°C for 3 0 seconds, 
and 72°C for 30 seconds] , followed by 25 cycles of [94°C for 30 seconds, 
45°C for 30 seconds, and 72°C for 30 seconds]. 

The DNA bands of expected lengths were cloned and their 

25 sequences determined. Among the 33 clones (59 to 74 bp) derived 
from the 12 bands that were produced by the nested PCR using a certain 
primer pair (corresponding to the peptide sequences 5 ' " GYLPQD " and 
3 ' "FVHHEDI" ) , the clones of 65 bp were especially amplified 6 to 
21 fold* It became clear that the genomic DNA sequence containing 

30 the 65 bp fragment has a 106 bp exon encoding 35 amino acid residues 
that are part of the PAS-B domain consisting of a total of 125 amino 
acids . We isolated the corresponding cDNA and named human PER (hPER) 
cDNA. Next, we cloned a mouse homologue (mPer) cDNA using the hPER 
cDNA as a probe. The nucleotide sequences determined are shown in 

35 SEQ ID NO: 3 for hPER, and SEQ ID NO: 4 for mPer. FISH revealed 
that the hPER gene and the mPer gene were located at 17pl2-13.1 and 
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11B, respectively, which are gene loci in synteny between the two 
species . 

The cDNA sequences of hPER and mPer contain ORF's that are 
expected to encode 1,290 amino acid residues and 1,291 amino acid 
5 residues, respectively. (See Figure 5, The putative amino acid 
sequence of the hPER gene product is shown in SEQ ID NO: 3, and that 
of the mPer gene product in SEQ ID NO: 4,) The amino acid identity 
between hPER and mPER is 92%, clearly indicating that hPER and mPer 
are conserved between the two species (Figure 5) • A homology search 

10 using the BLAST program on non-overlapping amino acid databases 
demonstrated that the two mammalian PER's showed the highest 
homology with dPER (type A) (Citri, Y • , et al., Nature 326:42-47, 
1987), Significant homologies between the mammalian PER and the 
Drosophila PER were concentrated on five domains (Figures 4 and 5) : 

15 I) N-terminal homologous regions (residues 44 to 131 of hPER and 
mPER) ; II) PAS-A (residues 217 to 282 for both homologues ) ; III) 
PAS-B (residues 338 to 456 for both homologues) and its immediate 
downstream sequence (residues 457 to 485 for both homologues); IV) 
a short segment corresponding to the downstream region from the site 

20 (residue 589) of the per S mutation (which shortens the circadian 
period) (residues 624 to 645 for both homologues); and V) regions 
homologous with the PER-C C-terminal region (residues 1006 to 1050 
for hPER and residues 1005 to 1049 for mPER) , subsequent 
serine-glycine (SG) repeats (residues 1051 to 1072 for hPER and 

25 residues 1050 to 1071 for mPER) , and further downstream homologous 
sequences (residues 1073 to 1108 for hPER and residues 1072 to 1107 
for mPER) . The homology in these regions are 44%, 47%, 56%, 64%, 
and 37%, respectively (Figure 4 ) . Although the PAS domains (regions 
II and III) of the PER homologues are fairly homologous to the 

30 corresponding region of dPER, other regions also show high 
homologies. Five structural domains and functional domains have 
been identified in dPER: a) the nuclear localization signal (NLS) 
(residues 66 to 79) (Vosshall, L, B. , et al. , Science 263 : 1606-1609, 
1996); b) the PAS domain (residues 233 to 490) necessary for dPER 

35 to interact with the NLS of TIM (Saez, L. and Young, M. W., Neuron 
17: 911-920, 1996); c) the cytoplasmic localization domain (CLD) 
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(residues 453 to 511 ) located downstream from the PAS -B repeats (Saez, 
L. and Young, M. W., Neuron 17: 911-9 20, 1996); d) the PER-C domain 
(residues 524 to 685) which interacts with the PAS domain in the 
self-polypeptide (Huang, Z, J., et al., Science 267: 1169-1172, 
5 1995); and e) the threonine-glycine (TG) repeats (residues 694 to 
748) and the immediate downstream region (residues 749 to 868) which 
control the rhythm of the species-specific mating song of Drosophila 
(Wheeler, D. A., et al., Science 251: 1082-1085, 1991). Thus, NLS, 
PAS, CLD, the two domains within PER-C, and the TG repeats and a 

10 segment next to its C-terminus in each mammalian PER are arranged 
in exactly the same order as in dPER. Interestingly, the TG repeats 
of dPER are replaced with short SG repeats in the C-terminal halves 
of the PER homologues (Figure 5). This segment, which is adjacent 
to PER-C, and the sequence homologous to the C-terminal side of the 

15 TG repeats are located approximately 350 bases downstream from the 
original locations in dPER ( Figure 4 ) . These regions are also highly 
conserved in both the human and the mouse (Figure 5). Six PER 
segments (C1-C6) that are highly conserved among different 
Drosophila species are seen (Figure 4) (Colot, H. V. , et al., EMBO 

20 J. 7: 3929-3937, 1988). Like in the silkmoth homologue of PER, the 
parts of the mammalian PER that are homologous with dPER are 
concentrated on the regions corresponding to C1-C3 of dPER (Figure 
4) (Reppert, S.M., et al., Neuron 13 : 1167-1176, 1994 ). Considering 
these observations, hPER and mPer are conclusively the structural 

25 homologues of per. 

Example 2 Expression of hPER and mPer 

The expression patterns of hPER and mPer were examined by 
northern hybridization according to the method of Church and Gilbert 

30 (Church, G. M. and Gilbert, W., Proc . Natl. Acad. Sci. USA 81: 
1991-1995, 1984). The filters were purchased from Clontech. The 
results are shown in Figure 6 (hPER) and Figure 7 (mPer) . The 
expression product of approximately 4.6 kb was detected in all the 
tissues tested from the adult human and the mouse. However, the 

35 levels of the hPER/mPer transcription product are not uniform as 
compared with those of glycerol-3-phosphate dehydrogenase (G3PDH) , 
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which is an enzyme in the glycolytic pathway and is abundantly and 
relatively constantly expressed in every cell- The wide 
distribution of the hPER/mPer expression is not surprising because 
in Drosophila the per expression has been detected in many tissues 
5 except the brain (Liu, X., etal., Genes Dev. 2: 228-238, 1988; Saez, 
L. and Young, M • W. , Mol. Cell. Biol. 8: 5378-5385, 1988). 

Example 3 Distribution of the mPer cDNA in the mouse brain 

The distribution of the mPer cDNA in the mouse brain was 

10 examined by in situ hybridization. Continuous cortical sections 
(40 iim thickness) of the mouse brain were prepared in the cryostat. 
In situ hybridization and determination of mRNA are described in 
the literature reference (e.g., Ban, Y . , Shigeyoshi, Y. and Okamura, 
H., J. Neurosci. 17: 3920-3931, 1997). The 33 P-labeled probes used 

15 in the hybridization were the sense and the antisense strands on 
the 5' side of the mPer cRNA (nucleotide positions 538-1752; data 
not shown) . After the signals were converted into relative optical 
concentrations using the 14 C-acrylic acid standard (Amersham, Inc. 
Pic. ) , the radioactivity was analyzed on each section on the BioMax 

20 film (Kodak) using a microcomputer connected to an image analyzer 
(MCID, Imaging Research, Inc.). These data were standardized 
against the difference in signal intensities between the equivalent 
regions of SCN and corpus callosum. The intensities of optical 
concentrations in the sections covering from the rostral end to the 

25 caudal end of SCN (10 pieces per mouse) were added, and the total 
was used as the measured value of the mPer mRNA quantity of this 
region. As a result, weak signals were detected from most brain 
areas including the cortical structures and non-cortical structures . 
Stronger mPer mRNA signals were detected from the pyramidal cell 

30 layer of piriform cortex, periventricular regions of the caudate 
putamen, many of the thalamic nuclei, and the granular layer of 
cerebellar cortex. Surprisingly, the highest mPer expression level 
in the brain was observed in SCN at a specific time (Figures 8 and 
9; explained below), 

35 In order to examine the time dependence of the mPer expression 

in SCN, mice were synchronized to an environment by keeping them 
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under the 12 h light/12 h dark (LD) conditions. The mPer mRNA was 
quantified by in situ hybridization and the competitive RT-PCR 
method. The competitive RT-PCR was performed as follows. First, 
we prepared mouse brain sections ( 0 . 5 mm thickness) in the "Mouse 
5 Brain Matrix" (Neuroscience, Inc., Tokyo). Using a microdissection 
needle (600 t um diameter), SCN was pressed out laterally 
symmetrically from the frozen sections under a stereoscopic 
microscope. Total RNA was extracted from SCN (n=4) using TRIZOL 
solution (BRL), treated with DNase I ( Stratagene ) , and purified 

10 using TRIZOL LS solution (BRL). "SUPERSCRIPT Preamplif ication 
System" (BRL) was used to reverse-transcribe approximately 1 u.g of 
RNA, and the cDNAs of mPer and P-actin were quantified by the 
competitive PCR method. The PCR products were electrophoresed on 
a non-denaturing PAGE gel (5.5%), stained with "SYBR Green" 

15 (Molecular Probes) , and the DNA in appropriate bands was quantified 
with "FMBI011 fluoroimage analyzer" (Hitachi) . The competitive DNA 
fragments for mPer and p-actin were constructed by making internal 
deletions in the respective cDNAs . mPer, mPer competitive factor, 
P-actin, and p-actin competitive factor were 4 82 bp, 24 6 bp, 122 8 

20 bp, and 104 4 bp, respectively. 

These two methods (in situ hybridization and the competitive 
RT-PCR method) produced similar oscillation profiles in LD (Figures 
8 and 10; upper panels). The mPer mRNA quantity reached a peak in 
the light condition (from ZT4 to ZT8 ; ZT indicates the time under 

25 the LD condition as in Figures 8 to 10), and fell to a minimum in 
the dark condition (from ZT16 to ZT2 0) (Figure 9; upper panel). 
Moreover, under the constant dark condition (DD), there were 
free-run changes (Figures 8 and 10; lower panels) , in which the mPer 
mRNA levels reached a peak between CT4 and CT8 (CT indicates the 

30 time under the DD condition as in Figures 8 to 10) and fell to a 
minimum between CT16 and CT2 0 (Figure 9; lower panel). The mPer 
mRNA in SCN is expressed with a strong and autonomous circadian 
period under the constant dark condition as described above, 
suggesting that this gene functions as a circadian rhythm pacemaker. 

35 Changes of the mPer mRNA in SCN with a circadian rhythm resemble 
the nervous activities in this brain region (Inouye, S-T. and 
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Kawamura, H . , Proc. Natl. Acad. Sci. USA 76: 5962-5966, 1979; 
Schwarts, W. J. and Gainer, H., Science 197: 1089-1092, 1977; 
Gillette, M. U. and Reppert, S. M. , Brain Res. Bull. 19: 135-139, 
1987 ) , reaching a peak in the daytime and falling to a minimum during 
5 the night. mPer may function as a controlling factor of the nervous 
activities in SCN. 



Industrial Applicability 

The present invention provides novel mammalian proteins and 

10 their genes involved in the circadian period. The proteins and the 
DNAs of the present invention are expected to be able to correct 
abnormalities of the circadian rhythm in the mammals, and would thus 
be useful for treating disorders related to circadian rhythms, such 
as sleep phase delay syndrome, sleep phase progression syndrome, 

15 non-circadian sleep-wake syndrome, irregular sleep-wake disorder, 
and time difference syndrome (so-called jet lag). They are also 
applicable to the labor and health management of irregular night 
time workers and to the prevention of such disorders as night 
poriomania in dementia. 
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CLAIMS 

^ 1. A protein derived from a mammal whose expression level in the 
suprachiasmatic nucleus (SCN) fluctuates with a circadian period- 
5 2. -'A protein of claim 1, wherein the mammal is a human. 

3, -A protein of claim 1, wherein the mammal is a mouse. 
J 4. A protein involved in the formation of circadian rhythm in 

the suprachiasmatic nucleus (SCN) comprising the amino acid sequence 
described in SEQ ID NO: 1 or said sequence in which one or more amino 
10 acids are substituted, deleted, or added. 
J 5. A protein involved in the formation of circadian rhythm in 
the suprachiasmatic nucleus (SCN) comprising the amino acid sequence 
described in SEQ ID NO: 2 or said sequence in which one or more amino 
acids are substituted, deleted, or added. 
15 J 6. A protein involved in the formation of circadian rhythm in 
the suprachiasmatic nucleus (SCN) encoded by the DNA having a 
sequence described in SEQ ID NO: 3 or by DNA that hybridizes with 
the DNA described in SEQ ID NO: 3. 
/ 7 . A protein involved in the formation of circadian rhythm in 
20 the suprachiasmatic nucleus (SCN) encoded by the DNA having a 
sequence described in SEQ ID NO: 4 or by DNA that hybridizes with 
the DNA described in SEQ ID NO: 4. 

8. DNA encoding the protein of any one of claims 1 to 5. 

9. DNA having the sequence described in SEQ ID NO: 3 or DNA that 
25 hybridizes with the DNA having the sequence described in SEQ ID NO: 

3, wherein the DNA encodes a protein involved in the formation of 
circadian rhythm in the suprachiasmatic nucleus (SCN). 

10. DNA having the sequence described in SEQ ID NO: 4 or DNA that 
hybridizes with the DNA having the sequence described in SEQ ID NO: 

30 4, wherein the DNA encodes a protein involved in the formation of 
circadian rhythm in the suprachiasmatic nucleus (SCN). 

11. A vector carrying the DNA of any one of claims 8 to 10. 

12. A transformant expressibly retaining the DNA of any one of 
claims 8 to 10. 

35 13. A method for producing the protein of any one of claims 1 to 

7, the method comprising culturing the transformant of claim 12. 
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ABSTRACT 

A human gene and a mouse gene corresponding to Drosophlla 
period gene which is known to be involved in the circadian period. 
5 The proteins and DNAs are applicable to the treatment of diseases 
relating to the circadian rhythm such as sleep phase delay syndorom, 
sleep phase progression syndrom, non-circadian sleep-wake syndrome, 
irregular sleep-wake disorder, and time difference syndrome 
(so-called jet lag), and to the labor and health management of 
10 irregular night time workers and the prevention of such disorders 
as night poriomania in dementia • 
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SEQUENCE LISTING 

<110> SAKAKI, Yoshiyuki 

<120> Mammalian Genes Involved in Circadian Periods 

<130> SEN-903PCT 

<150> JP 9-267846 
<151> 1997-09-12 

<160> 4 

<210> 1 
<211> 1290 
<212> PRT 

<213> Homo sapiens 
<400> 1 

Met Ser Gly Pro Leu Glu Gly Ala Asp Gly Gly Gly Asp Pro Arg Pro 
15 10 15 

Gly Glu Ser Phe Cys Pro Gly Gly Val Pro Ser Pro Gly Pro Pro Gin 
20 25 30 

His Arg Pro Cys Pro Gly Pro Ser Leu Ala Asp Asp Thr Asp Ala Asn 
35 40 45 

Ser Asn Gly Ser Ser Gly Asn Glu Ser Asn Gly His Glu Ser Arg Gly 
50 55 60 

Ala Ser Gin Arg Ser Ser His Ser Ser Ser Ser Gly Asn Gly Lys Asp 
65 70 75 80 

Ser Ala Leu Leu Glu Thr Thr Glu Ser Ser Lys Ser Thr Asn Ser Gin 
85 90 95 



Ser Pro Ser Pro Pro Ser Ser Ser He Ala Tyr Ser Leu Leu Ser Ala 
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100 



105 



110 



Ser Ser Glu Gin Asp Asn Pro Ser Thr Ser Gly Cys Ser Ser Glu Gin 
115 120 125 

Ser Ala Arg Ala Arg Thr Gin Lys Glu Leu Met Thr Ala Leu Arg Glu 
130 135 140 

Leu Lys Leu Arg Leu Pro Pro Glu Arg Arg Gly Lys Gly Arg Ser Gly 
145 150 155 160 

Thr Leu Ala Thr Leu Gin Tyr Ala Leu Ala Cys Val Lys Gin Val Gin 
165 170 175 

Ala Asn Gin Glu Tyr Tyr Gin Gin Trp Ser Leu Glu Glu Gly Glu Pro 
180 185 190 

Cys Ser Met Asp Met Ser Thr Tyr Thr Leu Glu Glu Leu Glu His He 
195 200 205 

Thr Ser Glu Tyr Thr Leu Gin Asn Gin Asp Thr Phe Ser Val Ala Val 
210 215 220 

Ser Phe Leu Thr Gly Arg lie Val Tyr He Ser Glu Gin Ala Ala Val 
225 230 235 240 

Leu Leu Arg Cys Lys Arg Asp Val Phe Arg Gly Thr Arg Phe Ser Glu 
245 250 255 

Leu Leu Ala Pro Gin Asp Val Gly Val Phe Tyr Gly Ser Thr Ala Pro 
260 265 270 



Ser Arg Leu Pro Thr Trp Gly Thr Gly Ala Ser Ala Gly Ser Gly Leu 
275 280 285 



Arg Asp Phe Thr Gin Glu Lys Ser Val Phe Cys Arg He Arg Gly Gly 
290 295 300 



3/31 



SPO-108 



Pro Asp Arg Asp Pro Gly Pro Arg Tyr Gin Pro Phe Arg Leu Thr Pro 
305 310 315 320 

Tyr Val Thr Lys He Arg Val Ser Asp Gly Ala Pro Ala Gin Pro Cys 
325 330 335 

Cys Leu Leu He Ala Glu Arg He His Ser Gly Tyr Glu Ala Pro Arg 
340 345 350 

He Pro Pro Asp Lys Arg lie Phe Thr Thr Arg His Thr Pro Ser Cys 
355 360 365 

Leu Phe Gin Asp Val Asp Glu Arg Ala Ala Pro Leu Leu Gly Tyr Leu 
370 375 380 

Pro Gin Asp Leu Leu Gly Ala Pro Val Leu Leu Phe Leu His Pro Glu 
385 390 395 400 

Asp Arg Pro Leu Met Leu Ala He His Lys Lys He Leu Gin Leu Ala 
405 410 415 

Gly Gin Pro Phe Asp His Ser Pro He Arg Phe Cys Ala Arg Asn Gly 
420 425 430 

Glu Tyr Val Thr Met Asp Thr Ser Trp Ala Gly Phe Val His Pro Trp 
435 440 445 

Ser Arg Lys Val Ala Phe Val Leu Gly Arg His Lys Val Arg Thr Ala 
450 455 460 

Pro Leu Asn Glu Asp Val Phe Thr Pro Pro Ala Pro Ser Pro Ala Pro 
465 470 475 480 



Ser Leu Asp Thr Asp He Gin Glu Leu Ser Glu Gin He His Arg Leu 
485 490 495 



Leu Leu Gin Pro Val His Ser Pro Ser Pro Thr Gly Leu Cys Gly Val 
500 505 510 
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Gly Ala Val Thr Ser Pro Gly Pro Leu His Ser Pro Gly Ser Ser Ser 
515 520 525 

Asp Ser Asn Gly Gly Asp Ala Glu Gly Pro Gly Pro Pro Ala Pro Val 
530 535 540 

Thr Phe Gin Gin He Cys Lys Asp Val His Leu Val Lys His Gin Gly 
545 550 555 560 

Gin Gin Leu Phe He Glu Ser Arg Ala Arg Pro Gin Ser Arg Pro Arg 
565 570 575 

Leu Pro Ala Thr Gly Thr Phe Lys Ala Lys Ala Leu Pro Cys Gin Ser 
580 585 590 

Pro Asp Pro Glu Leu Glu Ala Gly Ser Ala Pro Val Gin Ala Pro Leu 
595 600 605 

Ala Leu Val Pro Glu Glu Ala Glu Arg Lys Glu Ala Ser Ser Cys Ser 
610 615 620 

Tyr Gin Gin He Asn Cys Leu Asp Ser He Leu Arg Tyr Leu Glu Ser 
625 630 635 640 

Cys Asn Leu Pro Ser Thr Thr Lys Arg Lys Cys Ala Ser Ser Ser Ser 
645 650 655 

Tyr Thr Thr Ser Ser Ala Ser Asp Asp Asp Arg Gin Arg Thr Gly Pro 
660 665 670 

Val Ser Val Gly Thr Lys Lys Asp Pro Pro Ser Ala Ala Leu Ser Gly 
675 680 685 



Glu Gly Ala Thr Pro Arg Lys Glu Pro Val Val Gly Gly Thr Leu Ser 
690 695 700 



Pro Leu Ala Leu Ala Asn Lys Ala Glu Ser Val Val Ser Val Thr Ser 
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705 



710 



715 



720 



Gin Cys Ser Phe Ser Ser Thr He Val His Val Gly Asp Lys Lys Pro 
725 730 735 



Pro Glu Ser Asp He He Met Met Glu Asp Leu Pro Gly Leu Ala Pro 
740 745 750 

Gly Pro Ala Pro Ser Pro Ala Pro Ser Pro Thr Val Ala Pro Asp Pro 
755 760 765 

Ala Pro Asp Ala Tyr Arg Pro Val Gly Leu Thr Lys Ala Val Leu Ser 
770 775 780 

Leu His Thr Gin Lys Glu Glu Gin Ala Phe Leu Ser Arg Phe Arg Asp 
785 790 795 800 

Leu Gly Arg Leu Arg Gly Leu Asp Ser Ser Ser Thr Ala Pro Ser Ala 
805 810 815 

Leu Gly Glu Arg Gly Cys His His Gly Pro Ala Pro Pro Ser Arg Arg 
820 825 830 

His His Cys Arg Ser Lys Ala Lys Arg Ser Arg His His Gin Asn Pro 
835 840 845 

Arg Ala Glu Ala Pro Cys Tyr Val Ser His Pro Ser Pro Val Pro Pro 
850 855 860 

Ser Thr Pro Trp Pro Thr Pro Pro Ala Thr Thr Pro Phe Pro Ala Val 
865 870 875 880 

Val Gin Pro Tyr Pro Leu Pro Val Phe Ser Pro Arg Gly Gly Pro Gin 
885 890 895 



Pro Leu Pro Pro Ala Pro Thr Ser Val Pro Pro Ala Ala Phe Pro Ala 
900 905 910 
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Pro Leu Val Thr Pro Met Val Ala Leu Val Leu Pro Asn Tyr Leu Phe 
915 920 925 

Pro Thr Pro Ser Ser Tyr Pro Tyr Gly Ala Leu Gin Thr Pro Ala Glu 
930 935 940 

Gly Pro Pro Thr Pro Ala Ser His Ser Pro Ser Pro Ser Leu Pro Ala 
945 950 955 960 

Leu Pro Pro Ser Pro Pro His Arg Pro Asp Ser Pro Leu Phe Asn Ser 
965 970 975 

Arg Cys Ser Ser Pro Leu Gin Leu Asn Leu Leu Gin Leu Glu Glu Leu 
980 985 990 

Pro Arg Ala Glu Gly Ala Ala Val Ala Gly Gly Pro Gly Ser Ser Ala 
995 1000 1005 

Gly Pro Pro Pro Pro Ser Ala Glu Ala Ala Glu Pro Glu Ala Arg Leu 
1010 1015 1020 

Ala Glu Val Thr Glu Ser Ser Asn Gin Asp Ala Leu Ser Gly Ser Ser 
1025 1030 1035 1040 

Asp Leu Leu Glu Leu Leu Leu Gin Glu Asp Ser Arg Ser Gly Thr Gly 
1045 1050 1055 

Ser Ala Ala Ser Gly Ser Leu Gly Ser Gly Leu Gly Ser Gly Ser Gly 
1060 1065 1070 

Ser Gly Ser His Glu Gly Gly Ser Thr Ser Ala Ser He Thr Arg Ser 
1075 1080 1085 

Ser Gin Ser Ser His Thr Ser Lys Tyr Phe Gly Ser He Asp Ser Ser 
1090 1095 1100 



Glu Ala Glu Ala Gly Ala Ala Arg Gly Gly Ala Glu Pro Gly Asp Gin 
1105 1110 1115 1120 
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Val lie Lys Tyr Val Leu Gin Asp Pro He Trp Leu Leu Met Ala Asn 
1125 1130 1135 

Ala Asp Gin Arg Val Met Met Thr Tyr Gin Val Pro Ser Arg Asp Met 
1140 1U5 1150 

Thr Ser Val Leu Lys Gin Asp Arg Glu Arg Leu Arg Ala Met Gin Lys 
1155 1160 1165 

Gin Gin Pro Arg Phe Ser Glu Asp Gin Arg Arg Glu Leu Gly Ala Val 
1170 H75 1180 

His Ser Trp Val Arg Lys Gly Gin Leu Pro Arg Ala Leu Asp Val Met 
H85 1190 1195 1200 

Ala Cys Val Asp Cys Gly Ser Ser Thr Gin Asp Pro Gly His Pro Asp 
1205 1210 1215 

Asp Pro Leu Phe Ser Glu Leu Asp Gly Leu Gly Leu Glu Pro Met Glu 
1220 1225 1230 

Glu Gly Gly Gly Glu Gin Gly Ser Ser Gly Gly Gly Ser Gly Glu Gly 
1235 1240 1245 

Glu Gly Cys Glu Glu Ala Gin Gly Gly Ala Lys Ala Ser Ser Ser Gin 
1250 1255 1260 

Asp Leu Ala Met Glu Glu Glu Glu Glu Gly Arg Ser Ser Ser Ser Pro 
1265 1270 1275 1280 

Ala Leu Pro Thr Ala Gly Asn Cys Thr Ser 
1285 1290 

<210> 2 
<211> 1291 
<212> PRT 

<213> Mus musculus 
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<400> 2 

Met Ser Gly Pro Leu Glu Gly Ala Asp Gly Gly Gly Asp Pro Arg Pro 
15 10 15 

Gly Glu Pro Phe Cys Pro Gly Gly Val Pro Ser Pro Gly Ala Pro Gin 
20 25 30 

His Arg Pro Cys Pro Gly Pro Ser Leu Ala Asp Asp Thr Asp Ala Asn 
35 40 45 

Ser Asn Gly Ser Ser Gly Asn Glu Ser Asn Gly Pro Glu Ser Arg Gly 
50 55 60 

Ala Ser Gin Arg Ser Ser His Ser Ser Ser Ser Gly Asn Gly Lys Asp 
65 70 75 80 

Ser Ala Leu Leu Glu Thr Thr Glu Ser Ser Lys Ser Thr Asn Ser Gin 
85 90 95 

Ser Pro Ser Pro Pro Ser Ser Ser He Ala Tyr Ser Leu Leu Ser Ala 
100 105 110 

Ser Ser Glu Gin Asp Asn Pro Ser Thr Ser Gly Cys Ser Ser Glu Gin 
115 120 125 

Ser Ala Arg Ala Arg Thr Gin Lys Glu Leu Met Thr Ala Leu Arg Glu 
130 135 140 

Leu Lys Leu Arg Leu Pro Pro Glu Arg Arg Gly Lys Gly Arg Ser Gly 
145 150 155 160 

Thr Leu Ala Thr Leu Gin Tyr Ala Leu Ala Cys Val Lys Gin Val Gin 
165 170 175 



Ala Asn Gin Glu Tyr Tyr Gin Gin Trp Ser Leu Glu Glu Gly Glu Pro 
180 185 190 
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Cys Ala Met Asp Met Ser Thr Tyr Thr Leu Glu Glu Leu Glu His He 
195 200 205 

Thr Ser Glu Tyr Thr Leu Arg Asn Gin Asp Thr Phe Ser Val Ala Val 
210 215 220 

Ser Phe Leu Thr Gly Arg He Val Tyr He Ser Glu Gin Ala Gly Val 
225 230 235 240 

Leu Leu Arg Cys Lys Arg Asp Val Phe Arg Gly Ala Arg Phe Ser Glu 
245 250 255 

Leu Leu Ala Pro Gin Asp Val Gly Val Phe Tyr Gly Ser Thr Thr Pro 
260 265 270 

Ser Arg Leu Pro Thr Trp Gly Thr Gly Thr Ser Ala Gly Ser Gly Leu 
275 280 285 

Lys Asp Phe Thr Gin Glu Lys Ser Val Phe Cys Arg He Arg Gly Gly 
290 295 300 

Pro Asp Arg Asp Pro Gly Pro Arg Tyr Gin Pro Phe Arg Leu Thr Pro 
305 310 315 320 

Tyr Val Thr Lys He Arg Val Ser Asp Gly Ala Pro Ala Gin Pro Cys 
325 330 335 

Cys Leu Leu He Ala Glu Arg He His Ser Gly Tyr Glu Ala Pro Arg 
340 345 350 

He Pro Pro Asp Lys Arg He Phe Thr Thr Arg His Thr Pro Ser Cys 
355 360 365 



Leu Phe Gin Asp Val Asp Glu Arg Ala Ala Pro Leu Leu Gly Tyr Leu 
370 375 380 



Pro Gin Asp Leu Leu Gly Ala Pro Val Leu Leu Phe Leu His Pro Glu 
385 390 395 400 
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Asp Arg Pro Leu Met Leu Ala He His Lys Lys He Leu Gin Leu Ala 
405 410 415 

Gly Gin Pro Phe Asp His Ser Pro He Arg Phe Cys Ala Arg Asn Gly 
420 425 430 

Glu Tyr Val Thr Met Asp Thr Ser Trp Ala Gly Phe Val His Pro Trp 
435 440 445 

Ser Arg Lys Val Ala Phe Val Leu Gly Arg His Lys Val Arg Thr Ala 
450 455 460 

Pro Leu Asn Glu Asp Val Phe Thr Pro Pro Ala Pro Ser Pro Ala Pro 
465 470 475 480 

Ser Leu Asp Ser Asp He Gin Glu Leu Ser Glu Gin lie His Arg Leu 
485 490 495 

Leu Leu Gin Pro Val His Ser Ser Ser Pro Thr Gly Leu Cys Gly Val 
500 505 510 

Gly Pro Leu Met Ser Pro Gly Pro Leu His Ser Pro Gly Ser Ser Ser 
515 520 525 

Asp Ser Asn Gly Gly Asp Ala Glu Gly Pro Gly Pro Pro Ala Pro Val 
530 535 540 

Thr Phe Gin Gin lie Cys Lys Asp Val His Leu Val Lys His Gin Gly 
545 550 555 560 

Gin Gin Leu Phe He Glu Ser Arg Ala Lys Pro Pro Pro Arg Pro Arg 
565 570 575 

Leu Leu Ala Thr Gly Thr Phe Lys Ala Lys Val Leu Pro Cys Gin Ser 
580 585 590 

Pro Asn Pro Glu Leu Glu Val Ala Pro Val Pro Asp Gin Ala Ser Leu 
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595 600 605 

Ala Leu Ala Pro Glu Glu Pro Glu Arg Lys Glu Thr Ser Gly Cys Ser 
610 615 620 

Tyr Gin Gin He Asn Cys Leu Asp Ser He Leu Arg Tyr Leu Glu Ser 
625 630 635 640 

Cys Asn He Pro Ser Thr Thr Lys Arg Lys Cys Ala Ser Ser Ser Ser 
645 650 655 

Tyr Thr Ala Ser Ser Ala Ser Asp Asp Asp Lys Gin Arg Ala Gly Pro 
660 665 670 

Val Pro Val Gly Ala Lys Lys Asp Pro Ser Ser Ala Met Leu Ser Gly 
675 680 685 

Glu Gly Ala Thr Pro Arg Lys Glu Pro Val Val Gly Gly Thr Leu Ser 
690 695 700 

Pro Leu Ala Leu Ala Asn Lys Ala Glu Ser Val Val Ser Val Thr Ser 
705 710 715 720 

Gin Cys Ser Phe Ser Ser Thr He Val His Val Gly Asp Lys Lys Pro 
725 730 735 

Pro Glu Ser Asp He He Met Met Glu Asp Leu Pro Gly Leu Ala Pro 
740 745 750 

Gly Pro Ala Pro Ser Pro Ala Pro Ser Pro Thr Val Ala Pro Asp Pro 
755 760 765 

Thr Pro Asp Ala Tyr Arg Pro Val Gly Leu Thr Lys Ala Val Leu Ser 
770 775 780 



Leu His Thr Gin Lys Glu Glu Gin Ala Phe Leu Asn Arg Phe Arg Asp 
785 790 795 800 
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Leu Gly Arg Leu Arg Gly Leu Asp Thr Ser Ser Val Ala Pro Ser Ala 
805 810 815 

Pro Gly Cys His His Gly Pro lie Pro Pro Gly Arg Arg His His Cys 
820 825 830 

Arg Ser Lys Ala Lys Arg Ser Arg His His His His Gin Thr Pro Arg 
835 840 845 

Pro Glu Thr Pro Cys Tyr Val Ser His Pro Ser Pro Val Pro Ser Ser 
850 855 860 

Gly Pro Trp Pro Pro Pro Pro Ala Thr Thr Pro Phe Pro Ala Met Val 
865 870 875 880 

Gin Pro Tyr Pro Leu Pro Val Phe Ser Pro Arg Gly Gly Pro Gin Pro 
885 890 895 

Leu Pro Pro Ala Pro Thr Ser Val Ser Pro Ala Thr Phe Pro Ser Pro 
900 905 910 

Leu Val Thr Pro Met Val Ala Leu Val Leu Pro Asn Tyr Leu Phe Pro 
915 920 925 

Thr Pro Pro Ser Tyr Pro Tyr Gly Val Ser Gin Ala Pro Val Glu Gly 
930 935 940 

Pro Pro Thr Pro Ala Ser His Ser Pro Ser Pro Ser Leu Pro Pro Pro 
945 950 955 960 

Pro Leu Ser Pro Pro His Arg Pro Asp Ser Pro Leu Phe Asn Ser Arg 
965 970 975 

Cys Ser Ser Pro Leu Gin Leu Asn Leu Leu Gin Leu Glu Glu Ser Pro 
980, 985 990 



Arg Thr Glu Gly Gly Ala Ala Ala Gly Gly Pro Gly Ser Ser Ala Gly 
995 1000 1005 
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Pro Leu Pro Pro Ser Glu Glu Thr Ala Glu Pro Glu Ala Arg Leu Val 
1010 1015 1020 

Glu Val Thr Glu Ser Ser Asn Gin Asp Ala Leu Ser Gly Ser Ser Asp 
1025 1030 1035 1040 

Leu Leu Glu Leu Leu Leu Gin Glu Asp Ser Arg Ser Gly Thr Gly Ser 
1045 1050 1055 

Ala Ala Ser Gly Ser Leu Gly Ser Gly Leu Gly Ser Gly Ser Gly Ser 
1060 1065 1070 

Gly Ser His Glu Gly Gly Ser Thr Ser Ala Ser He Thr Arg Ser Ser 
1075 1080 1085 

Gin Ser Ser His Thr Ser Lys Tyr Phe Gly Ser He Asp Ser Ser Glu 
1090 1095 1100 

Ala Glu Ala Gly Ala Ala Arg Ala Arg Thr Glu Pro Gly Asp Gin Val 
1105 1110 1115 1120 

lie Lys Cys Val Leu Gin Asp Pro He Trp Leu Leu Met Ala Asn Ala 
1125 1130 1135 

Asp Gin Arg Val Met Met Thr Tyr Gin Val Pro Ser Arg Asp Ala Ala 
1140 1145 1150 

Ser Val Leu Lys Gin Asp Arg Glu Arg Leu Arg Ala Met Gin Lys Gin 
1155 1160 1165 

Gin Pro Arg Phe Ser Glu Asp Gin Arg Arg Glu Leu Gly Ala Val His 
1170 1175 1180 

Ser Trp Val Arg Lys Gly Gin Leu Pro Arg Ala Leu Asp Val Met Ala 
1185 1190 1195 1200 



Cys Val Asp Cys Gly Ser Ser Val Gin Asp Pro Gly His Ser Asp Asp 
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1205 1210 1215 

Pro Leu Phe Ser Glu Leu Asp Gly Leu Gly Leu Glu Pro Met Glu Glu 
1220 1225 1230 

Gly Gly Gly Glu Gly Gly Gly Cys Gly Val Gly Gly Gly Gly Gly Asp 
1235 1240 1245 

Gly Gly Glu Glu Ala Gin Thr Gin He Gly Ala Lys Gly Ser Ser Ser 
1250 1255 1260 

Gin Asp Ser Ala Met Glu Glu Glu Glu Gin Gly Gly Gly Ser Ser Ser 
1265 1270 1275 1280 

Pro Ala Leu Pro Ala Glu Glu Asn Ser Thr Ser 
1285 1290 

<210> 3 

<211> 3873 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(3873) 

<400> 3 

atg agt ggc ccc eta gaa ggg get gat ggg gga ggg gac ccc agg cct 48 

Met Ser Gly Pro Leu Glu Gly Ala Asp Gly Gly Gly Asp Pro Arg Pro 
15 10 15 

ggg gaa tea ttt tgt cct ggg ggc gtc cca tec cct ggg ccc cca cag 96 
Gly Glu Ser Phe Cys Pro Gly Gly Val Pro Ser Pro Gly Pro Pro Gin 
20 25 30 

cac egg cct tgc cca ggc ccc age ctg gee gat gac acc gat gee aac 144 
His Arg Pro Cys Pro Gly Pro Ser Leu Ala Asp Asp Thr Asp Ala Asn 
35 40 45 
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age aat ggt tea agt ggc aat gag tec aac ggg cat gag tct aga ggc 192 
Ser Asn Gly Ser Ser Gly Asn Glu Ser Asn Gly His Glu Ser Arg Gly 
50 55 60 

gca tct cag egg age tea cac age tec tec tea ggc aac ggc aag gac 240 
Ala Ser Gin Arg Ser Ser His Ser Ser Ser Ser Gly Asn Gly Lys Asp 
65 70 75 80 

tea gee ctg ctg gag ace act gag age age aag age aca aac tct cag 288 
Ser Ala Leu Leu Glu Thr Thr Glu Ser Ser Lys Ser Thr Asn Ser Gin 
85 90 95 

age cca tec cca ccc age agt tec att gee tac age etc ctg agt gec 336 
Ser Pro Ser Pro Pro Ser Ser Ser He Ala Tyr Ser Leu Leu Ser Ala 
100 105 no 

age tea gag cag gac aac ccg tec acc agt ggc tgc age agt gaa cag 384 
Ser Ser Glu Gin Asp Asn Pro Ser Thr Ser Gly Cys Ser Ser Glu Gin 
115 120 125 

tea gee egg gca agg act cag aag gaa etc atg aca gca ctt cga gag 432 
Ser Ala Arg Ala Arg Thr Gin Lys Glu Leu Met Thr Ala Leu Arg Glu 
130 135 140 

etc aag ctt cga ctg ccg cca gag cgc egg ggc aag ggc cgc tct ggg 480 
Leu Lys Leu Arg Leu Pro Pro Glu Arg Arg Gly Lys Gly Arg Ser Gly 
145 150 155 160 

acc ctg gec acg ctg cag tac gca ctg gec tgt gtc aag cag gtg cag 528 
Thr Leu Ala Thr Leu Gin Tyr Ala Leu Ala Cys Val Lys Gin Val Gin 
165 170 175 

gee aac cag gaa tac tac cag cag tgg age ctg gag gag ggc gag cct 576 
Ala Asn Gin Glu Tyr Tyr Gin Gin Trp Ser Leu Glu Glu Gly Glu Pro 
180 185 190 

tgc tec atg gac atg tec acc tat acc ctg gag gag ctg gag cac ate 624 ^ 
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Cys Ser Met Asp Met Ser Thr Tyr Thr Leu Glu Glu Leu Glu His He 
195 200 205 

acg tct gag tac aca ctt cag aac cag gat acc ttc tea gtg get gtc 672 
Thr Ser Glu Tyr Thr Leu Gin Asn Gin Asp Thr Phe Ser Val Ala Val 
210 215 220 

tec ttc ctg acg ggc cga ate gtc tac att teg gag cag gca gee gtc 720 
Ser Phe Leu Thr Gly Arg He Val Tyr He Ser Glu Gin Ala Ala Val 
225 230 235 240 

ctg ctg cgt tgc aag egg gac gtg ttc egg ggt acc cgc ttc tct gag 768 
Leu Leu Arg Cys Lys Arg Asp Val Phe Arg Gly Thr Arg Phe Ser Glu 
245 250 255 

etc ctg get ccc cag gat gtg gga gtc ttc tat ggt tec act get cca 816 
Leu Leu Ala Pro Gin Asp Val Gly Val Phe Tyr Gly Ser Thr Ala Pro 
260 265 270 

tct cgc ctg ccc acc tgg ggc aca ggg gec tea gca ggt tea ggc etc 864 
Ser Arg Leu Pro Thr Trp Gly Thr Gly Ala Ser Ala Gly Ser Gly Leu 
275 280 285 

agg gac ttt acc cag gag aag tec gtc ttc tgc cgt ate aga gga ggt 912 
Arg Asp Phe Thr Gin Glu Lys Ser Val Phe Cys Arg He Arg Gly Gly 
290 295 300 

cct gac egg gat cca ggg cct egg tac cag cca ttc cgc eta acc ccg 960 
Pro Asp Arg Asp Pro Gly Pro Arg Tyr Gin Pro Phe Arg Leu Thr Pro 
305 310 315 320 

tat gtg acc aag ate egg gtc tea gat ggg gee cct gca cag ccg tgc 1008 
Tyr Val Thr Lys He Arg Val Ser Asp Gly Ala Pro Ala Gin Pro Cys 
325 330 335 

tgc ctg ctg att gca gag cgc ate cat teg ggt tac gaa get ccc egg 1056 
Cys Leu Leu He Ala Glu Arg lie His Ser Gly Tyr Glu Ala Pro Arg 
340 345 350 
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ata ccc cct gac aag agg att ttc act acg egg cac aca ccc age tgc 1104 
He Pro Pro Asp Lys Arg He Phe Thr Thr Arg His Thr Pro Ser Cys 
355 360 365 

etc ttc cag gat gtg gat gaa agg get gec ccc ctg ctg ggc tac ctg 1152 
Leu Phe Gin Asp Val Asp Glu Arg Ala Ala Pro Leu Leu Gly Tyr Leu 
370 375 380 

ccc cag gac etc ctg ggg gee cca gtg etc ctg ttc ctg cat cct gag 1200 
Pro Gin Asp Leu Leu Gly Ala Pro Val Leu Leu Phe Leu His Pro Glu 
385 390 395 400 

gac cga ccc etc atg ctg get ate cac aag aag att ctg cag ttg gcg 1248 
Asp Arg Pro Leu Met Leu Ala lie His Lys Lys lie Leu Gin Leu Ala 
405 410 415 

ggc cag ccc ttt gac cac tec cct ate cgc ttc tgt gee cgc aac ggg 1296 
Gly Gin Pro Phe Asp His Ser Pro He Arg Phe Cys Ala Arg Asn Gly 
420 425 430 

gag tat gtc acc atg gac acc age tgg get ggc ttt gtg cac ccc tgg 1344 
Glu Tyr Val Thr Met Asp Thr Ser Trp Ala Gly Phe Val His Pro Trp 
435 440 445 

age cgc aag gta gec ttc gtg ttg ggc cgc cac aaa gta cgc acg gee 1392 
Ser Arg Lys Val Ala Phe Val Leu Gly Arg His Lys Val Arg Thr Ala 
450 455 460 

ccc ctg aat gag gac gtg ttc act ccc ccg gee ccc age cca get ccc 1440 
Pro Leu Asn Glu Asp Val Phe Thr Pro Pro Ala Pro Ser Pro Ala Pro 
465 470 475 480 

tec ctg gac act gat ate cag gag ctg tea gag cag ate cac egg ctg 1488 
Ser Leu Asp Thr Asp He Gin Glu Leu Ser Glu Gin He His Arg Leu 
485 490 495 

ctg ctg cag ccc gtc cac age ccc age ccc acg gga etc tgt gga gtc 1536 ' 
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Leu Leu Gin Pro Val His Ser Pro Ser Pro Thr Gly Leu Cys Gly Val 
500 505 510 

ggc gcc gtg aca tec cca ggc cct etc cac age cct ggg tec tec agt 1584 
Gly Ala Val Thr Ser Pro Gly Pro Leu His Ser Pro Gly Ser Ser Ser 
515 520 525 

gat age aac ggg ggt gat gca gag ggg cct ggg cct cct gcg cca gtg 1632 
Asp Ser Asn Gly Gly Asp Ala Glu Gly Pro Gly Pro Pro Ala Pro Val 
530 535 540 

act ttc cag cag ate tgt aag gat gtg cat ctg gtg aag cac cag ggc 1680 
Thr Phe Gin Gin He Cys Lys Asp Val His Leu Val Lys His Gin Gly 
545 550 555 560 

cag cag ctt ttt att gag tct egg gcc egg cct cag tec egg ccc cgc 1728 
Gin Gin Leu Phe He Glu Ser Arg Ala Arg Pro Gin Ser Arg Pro Arg 
565 570 575 

etc cct get aca ggc acg ttc aag gcc aag gcc ctt ccc tgc caa tec 1776 
Leu Pro Ala Thr Gly Thr Phe Lys Ala Lys Ala Leu Pro Cys Gin Ser 
580 585 590 

cca gac cca gag ctg gag gcg ggt tct get ccc gtc cag gcc cca eta 1824 
Pro Asp Pro Glu Leu Glu Ala Gly Ser Ala Pro Val Gin Ala Pro Leu 
595 600 605 

gcc ttg gtc cct gag gag gcc gag agg aaa gaa gcc tec age tgc tec 1872 
Ala Leu Val Pro Glu Glu Ala Glu Arg Lys Glu Ala Ser Ser Cys Ser 
610 615 620 

tac cag cag ate aac tgc ctg gac age ate etc agg tac ctg gag age 1920 
Tyr Gin Gin He Asn Cys Leu Asp Ser He Leu Arg Tyr Leu Glu Ser 
625 630 635 640 

tgc aac etc ccc age acc act aag cgt aaa tgt gcc tec tec tec tec 1968 
Cys Asn Leu Pro Ser Thr Thr Lys Arg Lys Cys Ala Ser Ser Ser Ser 
645 650 ' 655 
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tat acc acc tec tea gee tct gac gac gac agg cag agg aca ggt cca 2016 
Tyr Thr Thr Ser Ser Ala Ser Asp Asp Asp Arg Gin Arg Thr Gly Pro 
660 665 670 

gtc tct gtg ggg acc aag aaa gat ccg ccg tea gca gcg ctg tct ggg 2064 
Val Ser Val Gly Thr Lys Lys Asp Pro Pro Ser Ala Ala Leu Ser Gly 
675 680 685 

gag ggg gec acc cca egg aag gag cca gtg gtg gga ggc acc ctg age 2112 
Glu Gly Ala Thr Pro Arg Lys Glu Pro Val Val Gly Gly Thr Leu Ser 
690 695 700 

ccg etc gec ctg gee aat aag gcg gag agt gtg gtg tec gtc acc agt 2160 
Pro Leu Ala Leu Ala Asn Lys Ala Glu Ser Val Val Ser Val Thr Ser 
705 710 715 720 

cag tgt age ttc age tec acc ate gtc cat gtg gga gac aag aag ccc 2208 
Gin Cys Ser Phe Ser Ser Thr He Val His Val Gly Asp Lys Lys Pro 
725 730 735 

ccg gag teg gac ate ate atg atg gag gac ctg cct ggc eta gec cca 2256 
Pro Glu Ser Asp He He Met Met Glu Asp Leu Pro Gly Leu Ala Pro 
740 745 750 

ggc cca gee ccc age cca gec ccc age ccc aca gta gec cct gac cca 2304 
Gly Pro Ala Pro Ser Pro Ala Pro Ser Pro Thr Val Ala Pro Asp Pro 
755 760 765 

gec cca gac gee tac cgt cca gtg ggg ctg acc aag gec gtg ctg tec 2352 
Ala Pro Asp Ala Tyr Arg Pro Val Gly Leu Thr Lys Ala Val Leu Ser 
770 775 780 

ctg cac aca cag aag gaa gag caa gec ttc etc age cgc ttc cga gac 2400 
Leu His Thr Gin Lys Glu Glu Gin Ala Phe Leu Ser Arg Phe Arg Asp 
785 790 795 800 

ctg ggc agg ctg cgt gga etc gac age tct tec aca get ccc tea gec 2448 ' 
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Leu Gly Arg Leu Arg Gly Leu Asp Ser Ser Ser Thr Ala Pro Ser Ala 
805 810 815 

ctt ggc gag cga ggc tgc cac cac ggc ccc gca ccc cca age cgc cga 2496 
Leu Gly Glu Arg Gly Cys His His Gly Pro Ala Pro Pro Ser Arg Arg 
820 825 830 

cac cac tgc cga tec aaa gec aag cgc tea cgc cac cac cag aac cct 2544 
His His Cys Arg Ser Lys Ala Lys Arg Ser Arg His His Gin Asn Pro 
835 840 845 

egg get gaa gcg ccc tgc tat gtc tea cac ccc tea ccc gtg cca ccc 2592 
Arg Ala Glu Ala Pro Cys Tyr Val Ser His Pro Ser Pro Val Pro Pro 
850 855 860 

tec acc ccc tgg ccc acc cca cca gec act acc ccc ttc cca gcg gtt 2640, 
Ser Thr Pro Trp Pro Thr Pro Pro Ala Thr Thr Pro Phe Pro Ala Val 
865 870 875 880 

gtc cag ccc tac cct etc cca gtg ttc tct cct cga gga ggc ccc cag 2688 
Val Gin Pro Tyr Pro Leu Pro Val Phe Ser Pro Arg Gly Gly Pro Gin 
885 890 895 

cct ctt ccc cct get ccc aca tct gtg ccc cca get get ttc ccc gec 2736 
Pro Leu Pro Pro Ala Pro Thr Ser Val Pro Pro Ala Ala Phe Pro Ala 
900 905 910 

cct ttg gtg acc cca atg gtg gee ttg gtg etc cct aac tat ctg ttc 2784 
Pro Leu Val Thr Pro Met Val Ala Leu Val Leu Pro Asn Tyr Leu Phe 
915 920 925 

cca acc cca tec age tat cct tat ggg gca etc cag acc cct get gaa 2832 
Pro Thr Pro Ser Ser Tyr Pro Tyr Gly Ala Leu Gin Thr Pro Ala Glu 
930 935 940 

ggg cct ccc act cct gee teg cac tec cct tct cca tec ttg ccc gee 2880 
Gly Pro Pro Thr Pro Ala Ser His Ser Pro Ser Pro Ser Leu Pro Ala 
945 950 955 960 
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etc ccc ccg agt cct cct cac cgc ccg gac tct cca ctg ttc aac teg 2928 
Leu Pro Pro Ser Pro Pro His Arg Pro Asp Ser Pro Leu Phe Asn Ser 
965 970 975 

aga tgc age tct cca etc cag etc aat ctg ctg cag ctg gag gag etc 2976 
Arg Cys Ser Ser Pro Leu Gin Leu Asn Leu Leu Gin Leu Glu Glu Leu 
980 985 990 

ccc cgt get gag ggg get get gtt gca gga ggc cct ggg age agt gec 3024 
Pro Arg Ala Glu Gly Ala Ala Val Ala Gly Gly Pro Gly Ser Ser Ala 
995 1000 1005 

ggg ccc cca cct ccc agt gcg gag get get gag cca gag gee aga ctg 3072 
Gly Pro Pro Pro Pro Ser Ala Glu Ala Ala Glu Pro Glu Ala Arg Leu 
1010 1015 1020 

gcg gag gtc act gag tec tec aat cag gac gca ctt tec ggc tec agt 3120 
Ala Glu Val Thr Glu Ser Ser Asn Gin Asp Ala Leu Ser Gly Ser Ser 
1025 1030 1035 1040 

gac ctg etc gaa ctt ctg ctg caa gag gac teg cgc tec ggc aca ggc 3168 
Asp Leu Leu Glu Leu Leu Leu Gin Glu Asp Ser Arg Ser Gly Thr Gly 
1045 1050 1055 

tec gca gee teg ggc tec ttg ggc tct ggc ttg ggc tct ggg tct ggt 3216 
Ser Ala Ala Ser Gly Ser Leu Gly Ser Gly Leu Gly Ser Gly Ser Gly 
1060 1065 1070 

tea ggc tec cat gaa ggg ggc age acc tea gee age ate act cgc age 3264 
Ser Gly Ser His Glu Gly Gly Ser Thr Ser Ala Ser He Thr Arg Ser 
1075 1080 1085 

age cag age age cac aca age aaa tac ttt ggc age ate gac tct tec 3312 
Ser Gin Ser Ser His Thr Ser Lys Tyr Phe Gly Ser He Asp Ser Ser 
1090 1095 1100 

gag get gag get ggg get get egg ggc ggg get gag cct ggg gac cag 3360 
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Glu Ala Glu Ala Gly Ala Ala Arg Gly Gly Ala Glu Pro Gly Asp Gin 
1105 1110 1H5 1120 

gtg att aag tac gtg etc cag gat ccc att tgg ctg etc atg gec aat 3408 
Val He Lys Tyr Val Leu Gin Asp Pro He Trp Leu Leu Met Ala Asn 
1125 1130 1135 

get gac cag cgc gtc atg atg acc tac cag gtg ccc tec agg gac atg 3456 
Ala Asp Gin Arg Val Met Met Thr Tyr Gin Val Pro Ser Arg Asp Met 
1140 1145 1150 

acc tct gtg ctg aag cag gat egg gag egg etc cga gee atg cag aag 3504 
Thr Ser Val Leu Lys Gin Asp Arg Glu Arg Leu Arg Ala Met Gin Lys 
1155 1160 1165 

cag cag cct egg ttt tct gag gac cag egg egg gaa ctg ggt get gtg 3552 
Gin Gin Pro Arg Phe Ser Glu Asp Gin Arg Arg Glu Leu Gly Ala Val 
1170 1175 1180 

cac tec tgg gtc egg aag ggc caa ctg cct egg get ctt gat gtg atg 3600 
His Ser Trp Val Arg Lys Gly Gin Leu Pro Arg Ala Leu Asp Val Met 
1185 1190 1195 1200 

gec tgt gtg gac tgt ggg age age acc caa gat cct ggt cac cct gat 3648 
Ala Cys Val Asp Cys Gly Ser Ser Thr Gin Asp Pro Gly His Pro Asp 
1205 1210 1215 

gac cca etc ttc tea gag ctg gat gga ctg ggg ctg gag ccc atg gaa 3696 
Asp Pro Leu Phe Ser Glu Leu Asp Gly Leu Gly Leu Glu Pro Met Glu 
1220 1225 1230 

gag ggt gga ggc gag cag ggc age age ggt ggc ggc agt ggt gag gga 3744 
Glu Gly Gly Gly Glu Gin Gly Ser Ser Gly Gly Gly Ser Gly Glu Gly 
1235 1240 1245 

gag ggc tgc gag gag gec caa ggc ggg gee aag get tea age tct cag 3792 
Glu Gly Cys Glu Glu Ala Gin Gly Gly Ala Lys Ala Ser Ser Ser Gin 
1250 1255 1260 
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gac ttg get atg gag gag gag gaa gaa ggc agg age tea tec agt cca 3840 
Asp Leu Ala Met Glu Glu Glu Glu Glu Gly Arg Ser Ser Ser Ser Pro 
^65 1270 1275 1280 

gee tta cct aca gca gga aac tgc acc age tag 3873 
Ala Leu Pro Thr Ala Gly Asn Cys Thr Ser 
1285 1290 

<210> 4 
<211> 3876 
<212> DNA 

<213> Mus musculus 

<220> 
<221> CDS 
<222> (1)..(3876) 

<400> 4 

atg agt ggt ccc eta gaa ggg gee gat ggg gga gga gac ccc agg ccc 48 
Met Ser Gly Pro Leu Glu Gly Ala Asp Gly Gly Gly Asp Pro Arg Pro 
15 10 15 

gga gaa cct ttt tgt cct gga gga gtc cca tec cct ggg gec ccg cag 96 
Gly Glu Pro Phe Cys Pro Gly Gly Val Pro Ser Pro Gly Ala Pro Gin 
20 25 30 

cac egg cct tgt cca ggc ccc age ctg get gat gac act gat gca aac 144 
His Arg Pro Cys Pro Gly Pro Ser Leu Ala Asp Asp Thr Asp Ala Asn 
35 40 45 

age aat ggc tea agt ggc aat gag tec aac gga ccc gag tec agg ggc 192 
Ser Asn Gly Ser Ser Gly Asn Glu Ser Asn Gly Pro Glu Ser Arg Gly 
50 55 60 

gca tct cag egg agt tct cat agt tec tct tct ggc aat ggc aag gac 240 
Ala Ser Gin Arg Ser Ser His Ser Ser Ser Ser Gly Asn Gly Lys Asp 
65 70 75 80 • r 



24/31 

SPO-108 



tea get ctg ctg gag acc act gag age age aag agt aca aac tea cag 288 
Ser Ala Leu Leu Glu Thr Thr Glu Ser Ser Lys Ser Thr Asn Ser Gin 
85 90 95 

age cca tec cca ccc age age tec att gee tac age etc ctg agt gcg 336 
Ser Pro Ser Pro Pro Ser Ser Ser He Ala Tyr Ser Leu Leu Ser Ala 
100 105 110 

age tea gag cag gac aac cca tct acc agt ggc tgc age agt gaa cag 384 
Ser Ser Glu Gin Asp Asn Pro Ser Thr Ser Gly Cys Ser Ser Glu Gin 
115 120 125 

tea get cga gee agg acc cag aaa gaa etc atg act gca ctt egg gag 432 
Ser Ala Arg Ala Arg Thr Gin Lys Glu Leu Met Thr Ala Leu Arg Glu 
130 135 140 

etc aaa ctt cga ctg cca cca gag cgt egg ggc aag ggc cgc tct ggg 480 
Leu Lys Leu Arg Leu Pro Pro Glu Arg Arg Gly Lys Gly Arg Ser Gly 
145 150 155 160 

acc ttg gee aca ctg cag tac get ctg gee tgt gtc aag cag gtt cag 528 
Thr Leu Ala Thr Leu Gin Tyr Ala Leu Ala Cys Val Lys Gin Val Gin 
165 170 175 

get aac cag gaa tat tac cag cag tgg agt ctg gag gag ggt gag cct 576 
Ala Asn Gin Glu Tyr Tyr Gin Gin Trp Ser Leu Glu Glu Gly Glu Pro 
180 185 190 

tgt gec atg gac atg tct act tac acc ctg gag gaa ttg gag cat ate 624 
Cys Ala Met Asp Met Ser Thr Tyr Thr Leu Glu Glu Leu Glu His He 
195 200 205 

aca tec gaa tac aca ctt cga aac cag gac acc ttc tct gtg get gtg 672 
Thr Ser Glu Tyr Thr Leu Arg Asn Gin Asp Thr Phe Ser Val Ala Val 
210 215 220 

tec ttc ctg aca ggc egg att gtc tat att teg gag cag gca ggt gtc 720 
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Ser Phe Leu Thr Gly Arg He Val Tyr He Ser Glu Gin Ala Gly Val 
225 230 235 240 

ctg ctg cgt tgc aaa egg gat gtg ttt egg ggt gec cgc ttc tea gag 768 
Leu Leu Arg Cys Lys Arg Asp Val Phe Arg Gly Ala Arg Phe Ser Glu 
245 250 255 

etc ctg get ccc cag gat gtg ggt gtc ttc tat ggc tct act aca cca 816 
Leu Leu Ala Pro Gin Asp Val Gly Val Phe Tyr Gly Ser Thr Thr Pro 
260 265 270 

tct cga ctg ccc acc tgg ggc act ggc acc tct gca ggt tea ggt etc 864 
Ser Arg Leu Pro Thr Trp Gly Thr Gly Thr Ser Ala Gly Ser Gly Leu 
275 280 285 

aag gac ttc acc cag gaa aag tct gtc ttc tgc cga ate aga gga ggt 912 
Lys Asp Phe Thr Gin Glu Lys Ser Val Phe Cys Arg He Arg Gly Gly 
290 295 300 

cct gac egg gat cca ggg cct egg tac cag cca ttc cgc eta acc cca 960 
Pro Asp Arg Asp Pro Gly Pro Arg Tyr Gin Pro Phe Arg Leu Thr Pro 
305 310 315 320 

tat gtg acc aag att egg gtc tea gat gga gee cct gca cag ccg tgc 1008 
Tyr Val Thr Lys He Arg Val Ser Asp Gly Ala Pro Ala Gin Pro Cys 
325 330 335 

tgc eta etc att gee gag cgc ate cac tct ggt tat gaa get ccc egg 1056 
Cys Leu Leu He Ala Glu Arg He His Ser Gly Tyr Glu Ala Pro Arg 
340 345 350 

ate cct cct gac aag agg ate ttc acc acc cga cac aca cca age tgc 1104 
He Pro Pro Asp Lys Arg He Phe Thr Thr Arg His Thr Pro Ser Cys 
355 360 365 

etc ttc cag gat gta gat gaa agg get gee cca ctg ctg ggt tac ctt 1152 
Leu Phe Gin Asp Val Asp Glu Arg Ala Ala Pro Leu Leu Gly Tyr Leu 
370 375 380 
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ccc cag gat etc ctg ggg get cca gta ctt etc ttt eta cat cct gag 1200 
Pro Gin Asp Leu Leu Gly Ala Pro Val Leu Leu Phe Leu His Pro Glu 
385 390 395 400 

gac cga ccc etc atg ctg gee att cat aag aag ata ctg cag ctg gca 1248 
Asp Arg Pro Leu Met Leu Ala He His Lys Lys He Leu Gin Leu Ala 
405 410 415 

ggc cag ccc ttt gac cat tec cct att cgc ttc tgt get egg aac ggg 1296 
Gly Gin Pro Phe Asp His Ser Pro He Arg Phe Cys Ala Arg Asn Gly 
420 425 430 

gaa tat gtc acc atg gac ace age tgg gee ggt ttt gtg cac ccc tgg 1344 
Glu Tyr Val Thr Met Asp Thr Ser Trp Ala Gly Phe Val His Pro Trp 
435 440 445 

age cgc aag gtg get ttc gtg ttg ggt cgc cat aaa gtg cgc acg gca 1392 
Ser Arg Lys Val Ala Phe Val Leu Gly Arg His Lys Val Arg Thr Ala 
450 455 460 

ccc ctg aat gag gac gtc ttc act ccc cca gee ccc age cca get ccg 1440 
Pro Leu Asn Glu Asp Val Phe Thr Pro Pro Ala Pro Ser Pro Ala Pro 
465 470 475 480 

tec ctg gac tct gat ate cag gag etc tea gag cag ate cat cga ttg 1488 
Ser Leu Asp Ser Asp He Gin Glu Leu Ser Glu Gin He His Arg Leu 
485 490 495 

ctg ctg cag cct gtg cac age tec age ccc acg ggg etc tgt gga gtt 1536 
Leu Leu Gin Pro Val His Ser Ser Ser Pro Thr Gly Leu Cys Gly Val 
500 505 510 

ggc cct ctg atg tec cct ggt cct eta cac age cct ggc tec tec agt 1584 
Gly Pro Leu Met Ser Pro Gly Pro Leu His Ser Pro Gly Ser Ser Ser 
515 520 525 

gat age aat ggg ggg gac get gag ggg cct ggg cct cct get cca gtg 1632 ; 
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Asp Ser Asn Gly Gly Asp Ala Glu Gly Pro Gly Pro Pro Ala Pro Val 
530 535 540 

act ttc cag cag ate tgt aag gat gtg cat ctg gta aag cac cag gga 1680 
Thr Phe Gin Gin He Cys Lys Asp Val His Leu Val Lys His Gin Gly 
545 550 555 560 

caa cag etc ttc att gaa tct egg gec aag ccc cca ccc egg ccc cgc 1728 
Gin Gin Leu Phe lie Glu Ser Arg Ala Lys Pro Pro Pro Arg Pro Arg 
565 570 575 

etc ctt get aca ggt aca ttc aaa gee aaa gtc ctt ccc tgc cag tec 1776 
Leu Leu Ala Thr Gly Thr Phe Lys Ala Lys Val Leu Pro Cys Gin Ser 
580 585 590 

cca aac ccc gaa ctg gag gtg gee cca gtt cct gac caa gec teg tta 1824 
Pro Asn Pro Glu Leu Glu Val Ala Pro Val Pro Asp Gin Ala Ser Leu 
595 600 605 

gec ttg gec cct gag gag cca gag agg aaa gaa ace tct ggc tgt tec 1872 
Ala Leu Ala Pro Glu Glu Pro Glu Arg Lys Glu Thr Ser Gly Cys Ser 
610 615 620 

tac cag cag ate aac tgc ctg gac age ate etc agg tat ttg gag age 1920 
Tyr Gin Gin He Asn Cys Leu Asp Ser He Leu Arg Tyr Leu Glu Ser 
625 630 635 640 

tgc aac att ccc agt aca acc aag cgt aaa tgt gee tec tec tec tec 1968 
Cys Asn lie Pro Ser Thr Thr Lys Arg Lys Cys Ala Ser Ser Ser Ser 
645 650 655 

tac act gec tct tea gee tct gat gat gac aag cag agg gca ggt cca 2016 
Tyr Thr Ala Ser Ser Ala Ser Asp Asp Asp Lys Gin Arg Ala Gly Pro 
660 665 670 

gtt cct gtg ggg gec aag aaa gat ccg teg tea gca atg ctg tct ggg 2064 
Val Pro Val Gly Ala Lys Lys Asp Pro Ser Ser Ala Met Leu Ser Gly 
675 680 685 
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gag ggg gca act cct egg aag gag cca gtg gtg gga ggc acc ctg age 2112 
Glu Gly Ala Thr Pro Arg Lys Glu Pro Val Val Gly Gly Thr Leu Ser 
690 695 700 

ccg etc gee ctg gee aat aag gca gag age gtg gtg tec gtc acc agt 2160 
Pro Leu Ala Leu Ala Asn Lys Ala Glu Ser Val Val Ser Val Thr Ser 
705 710 715 720 

cag tgt age ttc age tec acc ate gtc cat gtg gga gac aag aag ccc 2208 
Gin Cys Ser Phe Ser Ser Thr He Val His Val Gly Asp Lys Lys Pro 
725 730 735 

ccg gag teg gac ate ate atg atg gaa gac ctg cct ggc ctg gee cct 2256 
Pro Glu Ser Asp He He Met Met Glu Asp Leu Pro Gly Leu Ala Pro 
740 745 750 

ggc cca gee ccc agt ccg gee ccc age ccc aca gta gee cct gac cca 2304 
Gly Pro Ala Pro Ser Pro Ala Pro Ser Pro Thr Val Ala Pro Asp Pro 
755 760 765 

acc cca gat get tat cgc cca gtg ggt ctg acc aag gee gtg ctg tec 2352 
Thr Pro Asp Ala Tyr Arg Pro Val Gly Leu Thr Lys Ala Val Leu Ser 
770 775 780 

ctg cac aca cag aag gaa gag caa gee ttc etc aac cgc ttc aga gat 2400 
Leu His Thr Gin Lys Glu Glu Gin Ala Phe Leu Asn Arg Phe Arg Asp 
785 790 795 800 

ctt ggc agg ctt cgt gga ctt gac acc tct tct gtg gec ccc tea gec 2448 
Leu Gly Arg Leu Arg Gly Leu Asp Thr Ser Ser Val Ala Pro Ser Ala 
805 810 815 

cct ggc tgc cac cat ggc ccc att ccc cct ggt cgc cga cac cac tgc 2496 
Pro Gly Cys His His Gly Pro He Pro Pro Gly Arg Arg His His Cys 
820 825 830 

cga tct aaa gca aag cgt tec cgc cac cac cac cac cag acc ccc egg 2544 r 
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Arg Ser Lys Ala Lys Arg Ser Arg His His His His Gin Thr Pro Arg 
835 840 845 

ccc gaa act ccc tgc tat gtc tec cat cct tea cct gtg ccc tct tct 2592 
Pro Glu Thr Pro Cys Tyr Val Ser His Pro Ser Pro Val Pro Ser Ser 
850 855 860 

gga ccc tgg cca ccc cca cca gec acg acc ccc ttc cca gca atg gtc 2640 
Gly Pro Trp Pro Pro Pro Pro Ala Thr Thr Pro Phe Pro Ala Met Val 
865 870 875 880 

cag ccc tac cca etc cca gta ttc tec cct cga gga gga ccc cag ccc 2688 
Gin Pro Tyr Pro Leu Pro Val Phe Ser Pro Arg Gly Gly Pro Gin Pro 
885 890 895 

ctt ccc cct gec cct aca tct gtg tec cct get acc ttc cct tct ccc 2736 
Leu Pro Pro Ala Pro Thr Ser Val Ser Pro Ala Thr Phe Pro Ser Pro 
900 905 910 

tta gtg acc cca atg gtg gec ttg gtg etc cct aac tat eta ttc cct 2784 
Leu Val Thr Pro Met Val Ala Leu Val Leu Pro Asn Tyr Leu Phe Pro 
915 920 925 

acc cca cct agt tat cca tat ggg gtg tec cag gec cct gtt gag ggg 2832 
Thr Pro Pro Ser Tyr Pro Tyr Gly Val Ser Gin Ala Pro Val Glu Gly 
930 935 940 

cca ccc acg cct get tec cac teg ccc tct cca tec ctg ccc cca cca 2880 
Pro Pro Thr Pro Ala Ser His Ser Pro Ser Pro Ser Leu Pro Pro Pro 
945 950 955 960 

cct etc age ccc ccc cac cgc cca gac tec cca ctg ttc aac teg aga 2928 
Pro Leu Ser Pro Pro His Arg Pro Asp Ser Pro Leu Phe Asn Ser Arg 
965 970 975 

tgc age tec cca etc cag etc aat ctg ctg cag ctt gag gag tec ccc 2976 
Cys Ser Ser Pro Leu Gin Leu Asn Leu Leu Gin Leu Glu Glu Ser Pro 
980 985 990 
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cgc acg gag ggg ggc get get gca gga ggc cca gga age agt get ggg 
Arg Thr Glu Gly Gly Ala Ala Ala Gly Gly Pro Gly Ser Ser Ala Gly 
995 1000 . 1005 



3024 



ccc ctg cct ccc agt gag gag act get gag cca gag gec aga ttg gtg 
Pro Leu Pro Pro Ser Glu Glu Thr Ala Glu Pro Glu Ala Arg Leu Val 
1010 1015 1020 



3072 



gag gtt act gag teg tec aat cag gat gca ctt tea ggc tec age gac 
Glu Val Thr Glu Ser Ser Asn Gin Asp Ala Leu Ser Gly Ser Ser Asp 
1025 1030 1035 1040 



3120 



ctg ctg gag eta ctg etc caa gaa gac tct cgc teg ggc aca ggc tec 
Leu Leu Glu Leu Leu Leu Gin Glu Asp Ser Arg Ser Gly Thr Gly Ser 
1045 1050 1055 



3168 



gca gec tea ggc tec ctg ggc tct ggc ctg ggc tct ggg tct ggt tea 
Ala Ala Ser Gly Ser Leu Gly Ser Gly Leu Gly Ser Gly Ser Gly Ser 
1060 1065 1070 



3216 



gga tec cac gaa ggg gga age ace tea gec age ate acc cgc age agt 
Gly Ser His Glu Gly Gly Ser Thr Ser Ala Ser He Thr Arg Ser Ser 
1075 1080 1085 



3264 



cag age age cat aca age aag tac ttt ggc age ate gac tct tec gag 
Gin Ser Ser His Thr Ser Lys Tyr Phe Gly Ser He Asp Ser Ser Glu 
1090 1095 1100 



3312 



get gaa get ggg get get egg gec agg act gag cct ggg gac cag gtc 
Ala Glu Ala Gly Ala Ala Arg Ala Arg Thr Glu Pro Gly Asp Gin Val 
1105 1110 1115 1120 



3360 



att aag tgt gtg etc cag gac ccc ate tgg ctg etc atg gec aat gec 3408 
He Lys Cys Val Leu Gin Asp Pro lie Trp Leu Leu Met Ala Asn Ala 
1125 1130 1135 

gac cag cgt gtc atg atg aca tac cag gtg ccg tec agg gat gca gec 3456. 
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Asp Gin Arg Val Met Met Thr Tyr Gin Val Pro Ser Arg Asp Ala Ala 
1140 1145 1150 

tct gtg ctg aag caa gac egg gag agg etc egg gee atg cag aaa cag 3504 
Ser Val Leu Lys Gin Asp Arg Glu Arg Leu Arg Ala Met Gin Lys Gin 
1155 1160 1165 

cag cca egg ttc tea gag gac cag agg egg gaa ctg ggt get gtg cac 3552 
Gin Pro Arg Phe Ser Glu Asp Gin Arg Arg Glu Leu Gly Ala Val His 
1170 1175 1180 

tec tgg gtc egg aag ggc cag ctg cct egg gee ctt gat gtg atg gcg 3600 
Ser Trp Val Arg Lys Gly Gin Leu Pro Arg Ala Leu Asp Val Met Ala 
1185 1190 1195 1200 

tgt gtg gac tgt ggc age age gtt caa gat cct ggc cac tct gat gac 3648 
Cys Val Asp Cys Gly Ser Ser Val Gin Asp Pro Gly His Ser Asp Asp 
1205 1210 1215 

ccg etc ttc tea gaa ctg gat gga ttg ggg ctg gag ccc atg gaa gag 3696 
Pro Leu Phe Ser Glu Leu Asp Gly Leu Gly Leu Glu Pro Met Glu Glu 
1220 1225 1230 

ggt gga ggc gag ggt ggt ggg tgt ggt gtt ggc ggc ggt ggg ggt gat 3744 
Gly Gly Gly Glu Gly Gly Gly Cys Gly Val Gly Gly Gly Gly Gly Asp 
1235 1240 1245 

ggt ggt gag gag gee cag acc caa att ggg get aag ggt tea age tct 3792 
Gly Gly Glu Glu Ala Gin Thr Gin He Gly Ala Lys Gly Ser Ser Ser 
1250 1255 1260 

cag gac tct gee atg gag gaa gaa gag caa ggt ggg ggc tea tec age 3840 
Gin Asp Ser Ala Met Glu Glu Glu Glu Gin Gly Gly Gly Ser Ser Ser 
1265 1270 1275 1280 

cca get tta cct gca gaa gaa aac age acc age tag 3876 
Pro Ala Leu Pro Ala Glu Glu Asn Ser Thr Ser 
1285 1290 
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